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Abstract. 

Consider entanglement concentration schemes that convert n identical copies 
of a pure state into a maximally entangled state of a desired size with success 
probability being close to one in the asymptotic limit. We give the distillable 
entanglement, the number of Bell pairs distilled per copy, as a function of an error 
exponent, which represents the rate of decrease in failure probability as n tends 
to infinity. The formula fills the gap between the least upper bound of distillable 
entanglement in probabilistic concentration, which is the well-known entropy of 
entanglement, and the maximum attained in deterministic concentration. The 
method of types in information theory enables the detailed analysis of the 
distillable entanglement in terms of the error rate. In addition to the probabilistic 
argument, we consider another type of entanglement concentration scheme, where 
the initial state is deterministically transformed into a (possibly mixed) final state 
whose fidelity to a maximally entangled state of a desired size converges to one 
in the asymptotic limit. We show that the same formula as in the probabilistic 
argument is valid for the argument on fidelity by replacing the success probability 
with the fidelity. Furthermore, we also discuss entanglement yield when optimal 
success probability or optimal fidelity converges to zero in the asymptotic limit 
(strong converse), and give the explicit formulae for those cases. 



PACS numbers: 03.67.-a, 03.67.Hk, 03.65.Ud 
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1. Introduction 

Quantum entanglement, an indispensable resource for quantum information processing 
such as superdense coding jD, quantum teleportation [|| , quantum cryptography |3| , 
and quantum computation [Ej, is expected to have a rich mathematical structure 
behind its weirdness. As in the case of other physical resources, quantification of 
entanglement is the key to understanding its full potential. The essentials of bipartite 
pure-state entanglement have already been revealed for both finite regimes and the 
asymptotic limit. The fundamental results are the intimate connection between the 
mathematical theory of majorization and entanglement manipulation [p[ pj 7, B[, and 
the existence of a unique measure of entanglement in the asymptotic limit [ 9 , lCfl . 
One way of quantifying entanglement is to estimate the number of Bell pairs, 

-L(|00) ab + |11>ab), (1) 

distilled from a given entangled state by local operations and classical communication 
(LOCC). Though the above quantity of distillable entanglement can be defined for 
mixed states, we deal with only pure states here. In order to make use of partially 
entangled states for quantum teleportation, we need to convert the partially entangled 
states into maximally entangled states by LOCC. The process is called entanglement 
concentration, and its efficiency in the asymptotic limit is the focus of this paper. 

The unique measure of bipartite pure-state entanglement gives the limitation on 
the efficiency of entanglement concentration. Suppose we share n identical copies of 
a partially entangled state 
d 

\<t>)=J2vp~i\m, (2) 

where the Schmidt coefficients squared are arranged in decreasing order, i.e., p\ > 
p 2 > . . . > p d > o, and sum to one. (Schmidt coefficients are arranged in decreasing 
order throughout this paper.) Bennett et al. || proved that the maximum number of 
Bell pairs distilled per copy from |</>)® n is given by 



d 



^entropy (0) = ~ V Pi log 2 Pi (3) 



i=l 



in the asymptotic limit, n — > oo. (Logarithms are taken to base two throughout 
this paper unless stated otherwise.) They imposed the condition that the success 
probability of entanglement concentration tends to one in the asymptotic limit, i.e., 

^success 1 £3 (4) 

where 

e — > as n — > oo. (5) 

With this restriction, the maximum attainable entanglement yield is proven to be 
equation (||). 

On the other hand, much research on entanglement concentration has been 
undertaken from various viewpoints j(| [?], |[ |HJ |T^, Among other things, 

the bound on entanglement yield in deterministic concentration []l3[ 

Edct{4>) = -logpi (6) 
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gives another quantification of entanglement. The restriction deterministic means that 
the process succeeds with probability one both in finite regimes and in the asymptotic 
limit. 

Though the quantities ©entropy and ©dot give entanglement yield in the asymptotic 
limit, where both processes succeed with probability one, the two quantities do not 
coincide. The main purpose of this paper is to find out the reason for the discrepancy. 
We will see that it is caused by the difference of the rate at which failure probabilities 
decrease when n tends to infinity in both concentration processes. Roughly speaking, 
while we obtain -E en tropy when failure probability decreases slowly, we obtain £^et when 
it decreases rapidly. We will represent the rate by the exponent of failure probability in 
the asymptotic limit (error exponent) . This is a common approach in the information 
sciences, and will allow us to 'tune' between the two extremes just mentioned. The 
quantum fixed-length pure state source coding in reference |15| is another example 
that uses the notion of error exponents in quantum information theory. 

In the derivation of ©entropy, we use the asymptotic equipartition property pq| . 
However, a detailed analysis of the asymptotic behaviour requires more powerful 
mathematical tools; namely, the method of types jl6[ [TtJ, which makes it possible 
to calculate the probabilities of rare events and derive stronger results than when we 
focus only on typical sequences. 

The argument via the method of types will give entanglement yield as a function of 
an error exponent and reveal the missing link between ©entropy and ©det- In addition, 
we will also see that the success probability exponentially decreases when we try 
to distil more entanglement than -Ecntropy (strong converse). This was observed in 
reference [[n], but here we are able to derive the exact error rate. 

It was suggested that the two extremes -©entropy and ©dot can also be expressed 
as some limits of Renyi entropy We will also see that they can be linked by 

Renyi-entropy-like functions, which is useful for practical calculation. 

We can consider two different ways of analyzing entanglement concentration: One 
is estimating the optimal success probability of obtaining the exact copy of a maximally 
entangled state of a desired size, where we discard failure cases. The other is estimating 
the fidelity of the final state, which is generally a mixed state, to a maximally entangled 
state of a desired size, where we obtain the final mixed state with probability one. 
We will first derive the entanglement yield as a function of an exponent of failure 
probability (error exponent) under the condition that the optimal success probability 
converges to one in the asymptotic limit. Then, we will also derive the entanglement 
yield as a function of an exponent of one minus fidelity, under the condition that the 
optimal fidelity converges to one in the asymptotic limit. The argument on fidelity 
can be reduced to that on probability via some lemmata. Finally, the yield functions 
will turn out to be in the same form in the both cases. Since transformations assumed 
in the argument on fidelity need not produce the exact copy of a maximally entangled 
state, the treatment is more natural from a physical perspective than that on optimal 
success probability. In finite regimes, deterministically transforming a pure state into 
a maximally entangled state with optimal fidelity has been discussed in reference |L8[ , 
while we will consider that in the asymptotic limit. The strong converse will also be 
analyzed in terms of fidelity. 

This paper is organized as follows. We begin in section [2] by revisiting the 
entanglement concentration in finite dimensions. After a brief review of the method of 
types in section^, we will move on to the main result of this paper, error exponents and 
asymptotic entanglement concentration, in section 0. Section discusses the strong 
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converse. In section g, we will present alternative formulae for the result proven in the 
preceding sections. Finally, we will discuss interpretations of our result and of some 
properties of the yield function, then conclude the paper. Some lemmata used in the 
proofs of our results are presented in Appendices. 

2. Finite-dimensional entanglement concentration revisited 

In this section, we revisit entanglement concentration of finite-dimensional states so 
that the asymptotic limit will be smoothly derived from it. Suppose we distil a 
maximally entangled state with Schmidt number L(< d), 



v »=i 

from the partially entangled state with Schmidt number d, equation (|^). Lo and 
Popescu pi) derived the optimal probability with which we distil from \<j>): 



Note that the Schmidt number of the initial state cannot be expanded by LOCC. 
Thus, we cannot distil a maximally entangled state with the Schmidt number greater 
than d. 

In the following, we reformulate the optimal probability Pl in a more suitable 
form for our treatment of asymptotic entanglement concentration. Our strategy for 
distilling a maximally entangled state consists of two parts: 

(i) We perform a two-valued local measurement to change the initial distribution of 
the Schmidt coefficients (probabilistic part). 

(ii) If a desired result is obtained in the above measurement, we distil a maximally 
entangled state from the resultant state with probability one (deterministic part). 

First, we briefly review the second (deterministic) part, which was investigated 
in reference [ fL3| . Suppose we wish to distil a maximally entangled state with the 
greatest possible Schmidt number from \<p) with probability one. The maximum 
Schmidt number of the maximally entangled state is |_1 /pij , where \_x\ represents the 
largest integer equal to or less than x. Thus, if we could make pi smaller somehow, 
the size (Schmidt number) of the resultant maximally entangled state would become 
greater. Note that, according to Nielsen's theorem |5j, the largest Schmidt coefficient 
cannot be deterministically decreased by LOCC. 

In order to distil a maximally entangled state of size L > [1 /p±\ , we need to adjust 
the largest Schmidt coefficient of the initial state before moving on to deterministic 
entanglement concentration in the second part. So, in the first (probabilistic) part, 
we perform the measurement presented below to truncate the initial distribution of 
the Schmidt coefficients. 

As shown in figure 0, first we draw a truncating line that represents probability t, 
which is uniquely determined by the size of the maximally entangled state as proved 
later. Then we perform a two- valued local measurement on either side of the entangled 
pair by using measurement operators, 




(7) 




(8) 




(9) 
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with pi > p 2 > ■■■ > Pi*-i > t > pi* > ••• > Pd, and M 2 such that 
M^Mi + m\m 2 = I. Measurement outcome 1 corresponds to the success of the 
entanglement concentration. It is easily seen that it occurs with probability 

d 

Success =|| (Mi®/) |0) || 2 =^min{t, K }, (10) 

i=l 

which is shown schematically as the area of the shaded region below the truncating line 
t in figure [l]. Measurement outcome 2 corresponds to the failure of the concentration, 
and the failure probability, 1 — -Psucccss, is equal to the area of the region above the 
truncating line in figure H. 



probability 
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r i 








truncating line 
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O 1 2 l*-l 1* d-l d 1 



i=l» 



Figure 1. The distribution of the squared Schmidt coefficients of a partially 
entangled state to be concentrated. Each bar corresponds to a Schmidt coefficient 
squared. The area of the shaded region below the truncating line f represents the 
success probability of entanglement concentration. 

In the case of measurement outcome 1, the post- measurement state becomes 



y/ /success y/si 

whose Schmidt coefficients squared are represented as the shaded region in figure |] 
with an appropriate renormalization; Each bar (Schmidt coefficient) is replaced with 
that divided by P S uccess- Since the largest Schmidt coefficient of the post-measurement 
state is -y/£/.P SUCC0SS , we can distil a maximally entangled state of size 
p 

j * success ( ~\ 

(12) 

In other words, our scheme distils a maximally entangled state of size / with 
probability P S ucccss- The truncating probability t is uniquely determined by a given 
integer / via an implicit function 

Z = ]T>in{l,!}, (13) 



Error exponents for entanglement concentration 



G 



which is given by equations ( |l0| ) and (|T^) . Since the right-hand side of equation ( [i"3| ) 
is strictly monotone decreasing in [1/pi, d] for t G [pd,Pi], £ is uniquely determined for 
a given integer L G [l/pi,d]. (When L < 1/pi, \&l) can be distilled with probability 
one. Thus, we do not need the probabilistic part.) 

The rest of this section gives the proof that the probability P S ucccss coincides 
with the optimal probability Pl [equation (||)]. The truncating probability t uniquely 
determines an integer I* satisfying pi* _i > t > pi* . (See figure |l|.) From equations ( |To| ) 
and @, we have 



= t(l* -l) + Y J Pi=tL. 



(14) 



i=i* 



Thus, 



L-l* 



Substituting this into equation 
with I* and L: 



(in 



(15) 



we obtain an alternative expression of P s 



L-l* 



d 



Pi- 



success 



(16) 



On the other hand, equation (|14| ) gives t(l* — 1) < fL, i.e. 



due to the right-hand side of equation (| 



I* G [l,L]. Thus, we have 
which means the success 



probability of our scheme is optimal because Pl has already been proven optimal. 

Therefore, we obtain the following equation that connects the size of a maximally 
entangled state L, the truncating probability t, and the optimal success probability of 
concentration Pl'. 



Pl = tL. 



(17) 



3. The method of types 

In order to analyze entanglement concentration in the asymptotic limit, we will employ 
the method of types in the following sections. In this section, we briefly summarize 
relevant definitions and lemmata on the method of types without giving the proofs. 
For detailed discussions and proofs, see chapter 12 in reference jl6| and chapter 1 in 
reference [jl7| . 

Let Xi,X2,---,X n be a sequence of n symbols from an alphabet A = 
{01,02, • • • , ad}, where d is the number of symbols in the alphabet A. A sequence 
x\Xi ■ ■ ■ x n will be denoted by x. 

Definition 1 The type p x of a sequence x\X2 ■ ■ ■ x n is the relative proportion of 
occurrences of each symbol of A, i.e., 

, , N(a\x) , . 

pja) = y 1 ; for all a G A. (18) 
n 

where iV(a|x) is the number of times the symbol a occurs in the sequence x G A n . 
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A type p x is a map from symbols a € A to their frequencies in the sequence x 
(empirical probability distribution). We denote the set of types with denominator n 
by V n . If p G V n , then the set of sequences of length n and type p is called the type 
class of p, denoted by T™, i.e., 

r; = {xei> x = P }. (19) 

In other words, n-letter sequences that coincide when rearranged in alphabetical order 
are in the same type class. 

Lemma 2 

\V n \<(n+l) d . (20) 

Though the number of sequences in A n is exponential in n, the number of types 
grows at most polynomially in n, which means that at least one type has exponentially 
many sequences in its type class. 

If each letter in sequences is drawn i.i.d. according to some probability 
distribution, then we can estimate the probability with which a sequence occurs by 
using the Shannon entropy H(p) and the relative entropy D{p || q), i.e., 



H{p) = -J2Pi l °SPi, (21) 

i=l 



and 



0(p||«)=y>log^, (22) 

i=l qi 

where p and q are probability distributions. 

Lemma 3 If X\, X%, — , X n are drawn i.i.d. according to q(x), then the probability 
of x depends only on its type and is given by 

<f(x) = 2- n{H{Px)+D{pM} . (23) 

Furthermore, we can also estimate the size of a type class T£\ the number of the 
sequences in T™ is bounded as follows: 

Lemma 4 For arbitrary type p € V n , 

1 onff(p) < \rpn\ < nnff(p) (r,A\ 

(n + 1)* ^\ i p\^ z ■ W 

From Lemmata |3| and ^, we obtain the bound of the probability of a type class. 

Lemma 5 For arbitrary type p G V n and arbitrary probability distribution q, the 
probability of the type class T" under q n is bounded as 



1 o-nD(p\\q) < n n/rpn\ < n-nD(p\\q) 

{n+iy S9t ^- 



(25) 



The lemmata summarized above are also shown schematically in figure ^ for easy 
reference in the proofs of our results, where the method of types will be heavily used. 



Error exponents for entanglement concentration 

probability 



8 



q"(x) 



type p 
i 



q"(T P ") 



O 



\T P '\ 



-tL 



Figure 2. A schematic summary of the method of types. This bar graph 
represents the probability distribution on n-letter sequences that are drawn 
i.i.d. according to q(x). Each bar corresponds to a type class, and consists 
of the sequences of the type. The number of 'stairs' is the number of types 
Vn : \P n \ < (n + The height of each bar is the probability with which 

each sequence in the type class occurs: <j n (x) = 2 _n ' ir ' Px ' +D ' Px ll 9 " . The width 
of each bar is the number of sequences in the type class: (n + l)~ d 2 nH ^ < 
|T™| < 2 nH ( p }. The area of each bar is the probability of the type class: 
(n + l)-d2- nD (Ptti) < g n (T") < 2- nD ^i'> . (Note that if q(x) degenerates, 
i.e., q(xi) = q(xj) for some Xi and Xj such that i ^ j, then sequences in different 
type classes can occur with the same probability. Thus, one bar can consist of 
different type classes.) 



4. Asymptotic entanglement concentration 

This section presents the main result of this paper, asymptotic entanglement 
concentration from the viewpoint of error exponents. Suppose we wish to distil a 
maximally entangled state of size L n from n identical copies of \<j>), i.e., \<f))® n = 
Si \/p™~P)|i)|i)> where p n (i) is the n-i.i.d. extension of pi. Applying the results in 
section ^ to the n-i.i.d. case, we also have the optimal success probability 

P iB = 5^min{t n ,p n (i)} J (26) 

i 

and the relation between Pl„, L n , and the truncating line t n , 

P Ln = t n L n . (27) 

In the following, we consider the case where the optimal success probability Pl„ 
converges to one as the number of entangled pairs n increases. The rate of the 
convergence is represented by an error exponent r, the first-order coefficient in the 
exponent of the failure probability in the asymptotic limit, which is defined as 

r= lim {--log(l-P Ln )\. (28) 



Intuitively, this means that the error probability behaves as 2 
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We will derive the maximum number of Bell pairs distilled per copy in the 
asymptotic limit, E, as a function of the error exponent r. First, we prove a theorem 
that relates entanglement yield and an error exponent via a monotone function, from 
which we will derive a formula for entanglement yield E(r). 

Theorem 6 Consider a sequence of entanglement concentration schemes converting 
n identical copies of \<p) = 53j=i v^* I *' e '' I'W 8 ™; a maximally entangled 

state of size L n , which attain the optimal success probability Pl„- Suppose 

limsup [ - logi n ] < H(p), (29) 



n — >oo 



r> 



and 



-logi„ > -logpi, (30) 
n 



where p = (pi, • • • ,Pd)- Then, 



lim sup ( - log L n )=f (lim inf { - - log(l - P Ln ) } ) , ( 8 1 ) 



and 



where 



lim inf ( - logO = / (limsup (-- log(l - P L J }> I , (82 ) 



n^oo \ n J \ n^oc fl 



f(r)= min {D(q \\ p) + H (<?)} . (33) 

<j:-D(g||p)<r 



fl„ = --logt n . (34) 



Proof. Let 



Then, equation (|27|) gives 

Rn= - log L n - - log P Ln . (35) 
n n 

In what follows, we only consider a convergent sub-sequence of {R n }, that is, 
take an infinite subset J\f C {1,2,...} such that R' = lim„^ 00 .„ e 7V' Rn exists. For 
simplicity, we denote the sub-sequence {R n }neAf as {R' n }, and omit n 6 Af . Since 
equation (|2^) implies lim JWOO P^ n — 1, equation (|3^) gives 

lim (-logL;) =R', (36) 

where denotes the sub-sequence that corresponds to {R' n }. Then, equations ( p9| ) 
and ( |30| ) imply 

- logpi < R' < H(p). (37) 
Equation (pq) gives 



- - log(l - P L ,J = -- log [1-2 min{4,p" (i)} ) 



(38) 



where {t' n } is a sub-sequence such that R' n = —n 1 logi^. 
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In the following, we estimate the right-hand side of equation ( pq ) by the method 
of types. Rewriting the area above the truncating probability t' n in terms of the type 
theory, we obtain 

l-5>in{Cp"(i)}= ]T (^(T»)-|TX)- (39) 

i ieV n :p n (q)>t' n 



Invoking equations (|20j), fl23|), and(|25|), we have the inequalities 
E (p n (2?) - |TX) < E ^ T <D 

q£V n :p n (q)>t' n q€V n :p" (q)>t' n 



<(n + l) d max 2 - nI5 ^^. (40) 

?e-P„:£>(9||p)+H(g)<i^ 



Note that p n (q) > t' n is equivalent to D(q \\ p) + H(q) < R' n due to equations fl23| ) and 
(|4j). Thus, together with equations (|3^) and ([59|), we have 

--log(l-Pz/) > --log(n + l) + min D(q\\p). (41) 

n ™ TI qeV n :D(q\\p)+H(q)<R' n 

Therefore, for — logpi < R' < H(p), 

liminf j--log(l - P L ol > min D (q II p). (42) 

n^oo { U " J q:D(q\\p)+H(q)<R> 

Next, first we consider the case — logpi < R 1 . Then, for any q satisfying 

-log Pl < D(q\\p) + H(q)<R', (43) 
there exists a sequence of types q' n € V n such that 

D(q' n || p) + H(q' n ) < R' n with lim q' n = q. (44) 

n—>oo 

Invoking equations ( |24| ) and (|25[), we also have 

qeV n :p™(q)>t' n 

> ^ o - "- ^"^^ — 2 nH ( q '^2~ nR ' n (45) 
~ (n + l) d ' y ' 

Thus, together with equations |I^) and ([39j), we have 

- - log(l -P L ,)< -- log (2- n (^"HP>+£ M"+i)) - 2-™«- ff ^"))) . (46) 
Equations (pf) and (ph imply 



lim \D(q' n || p) + - log(n + 1)1 < lim « - ff(O) . 



(47) 



Applying Lemma FTI] in Appendix A to equation (Eft) , we obtain 



limsup { -- log(l -Pv)\< D(q || p), (48) 



which holds for any q satisfying equation. (|43|). Therefore, for — logpi < R' < H(p), 
limsup {-- log(l - P L ,)\ < inf D(q || p) 

n-cx, 1 n "'J ~ 9 :D(g||p)+ff(g)<_R' 

= min D(<7 || »). (49) 

q :D(q\\p)+H(q)<R> 
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Equations ( |42| ) and ( |49| ) show that when — logpi < R' < H(p), the sub-sequence 
{— n^ 1 log(l — P^i )} is also convergent and 

r' = Urn 1 log(l - P L , )\ = min D(q II »). (50) 

™^oo [ n " J g:-D(</lb)+ff(<?)<fl' 

According to Corollary |l6| in Appendix B . setting S(q) = D(q \\ p) + H(q), 
U{q) = D(q || p), x = R, and noting that x\ = — logpi, and X2 — H(p), we 
see that the function R i— » vnm q -.D(q\\p)+H(q)=R D(q \\ p) is continuous and strictly 
monotone decreasing in (0, — logpi) for R G (— logpi, H{p)). Thus, so is the function 
R i — > mhiq.^^Hp)^^).^ P(q || p), which is the inverse function of f(r) [equation (p3|)] 
(see figure ||) . Therefore, equations (^) and (|5^) provide 



i.e., 



for 



lim (-ilog(l-P i; )} = /- 1 ( lim (ilogX; 
hm (- \ogL' n ) = f ( lim (-- log(l - P L , ) 



(51) 
(52) 



and 



lim { -ilog(l-P L J } »Iuk/„. (•-»:-!) 

-logp 1 < lim (ilogP;) <H(p). (54) 
On the other hand, when R' = — logpi, equations (^) and (^|) give 

lim ( - log L' ) = - log pi , (55) 

and 

liminf { log(l — P[/ ) \ > — logpi. (56) 

n— >oo [ n "J 

In addition, for r > — logpi, 

f(r)= min {£>(<? || p) + H(q)} = - log Pl . (57) 

q:D{q\\p)<r 

Note that D(q \\ p) + H(q) = — Yli=i 1i ^°&Pi > — logpi with equality when 
q = (1,0, • • • , 0), i.e., D(q \\ p) = - logp x . Hence, 

- logpi = lim ( - logi^ ] = / (liminf { -- log(l - P L , 

n— too \n J \ n— >oo In 



= / (lim sup \--log(l- P L ,j) 



(58) 



limmN-ilogtl-P-) } >-log Pl . (:•!)) 



for 

liminf < - 

The above argument holds for any convergent sub-sequences {R' n }, and f(r) is 
monotone decreasing. Therefore, equations (|52]), (^), (|54|), (|58|), and ( |59| ) provide 
equations ([H]) and (^) (see figure ||) . □ 

Theorem || leads to the following corollary, which gives the maximum asymptotic 
entanglement yield E(r) under the requirement that the failure probability decreases 
as rapidly as 2~ nr : 
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Corollary 7 Consider a sequence of entanglement concentration schemes converting 
\(f))® n into a maximally entangled state of size L n with success probability Psuccess, 
such that 



r<liminf(-ilog(l-Pj u t css )) 



(60) 



Let us denote the class of all such sequences by C(r). Then, for r > 0, 



E(r) = maxlimsup — logL, 

C(r) n->oo \n 



max lim inf — log L, 

C(r) n—>oc \n 



min {D(q \\ p) + H(q)} . 

q-D(q\\p)<r 



(61) 



This corollary connects the following two facts on the distillable entanglement of 
bipartite pure states E: 

(i) If we allow error probability that vanishes in the asymptotic limit, E cannot 
exceed H(p) §. 

(ii) If we stick to deterministic strategy even in the finite regimes (i.e., no error is 
allowed), E is equal to — logpi 

Equation ( |6l| ) provides the missing link between them, i.e., -Eontropy = H(p) = 
lim r ^Q E(r) and E^ot — ~logpi = linir^oo E(r). The distillable entanglement 
E{r) monotonically decreases and reaches — logpi when r = — logpi, which means 
that probabilistic concentration schemes with error exponents greater than — logpi 
effectively give the same result as deterministic ones. 




log Pi 



-log;?; 



Figure 3. Entanglement yield in asymptotic entanglement concentration with an 
error exponent r. The horizontal axis represents the error exponent. The vertical 
axis represents the number of Bell pairs distilled per copy in the asymptotic limit: 
E(r) = min 9 . D(9 || p) < r {D(q )| p) + H{q)}. 



Next, we move to a discussion about entanglement yield and fidelity Suppose 
we wish to transform the initial state \(fi)® n deterministically into some final (possibly 
mixed) state that is as close to a maximally entangled state of size L n as possible. 
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Instead of the success probability Psulcess in the previous argument, we here require 
that the fidelity F n between the final state and the maximally entangled state |$l„) 
approach unity as rapidly as 1 — 2~ nr , namely, 

r<limmf|--log(l-F n )i. (62) 

n-+oo ^ n J 

Let us denote the class of all such sequences by Cf{t). The maximum asymptotic 
entanglement yield Ep{r) over Cf(t) can be obtained by reducing the problem to that 
of probabilistic schemes via the following two lemmata. 

Lemma 8 // the transformation \<j)) > is possible with probability 1 — e, then 

there exists a deterministic transformation \<p) — > p with fidelity {<5>l\p\<&l) > 1 — e. 

The proof is straightforward by considering the case p = (1 — e)| $z,)($l | + ep'. This 
lemma implies that for any sequence of probabilistic schemes belonging to C(r) with 
size {L n }, there exists a sequence belonging to Cf{t) with the same size {L n }. Hence, 

max lim inf f — log L n ) > E(r). (63) 

C F (r) rwoo V" / ~ 

Lemma 9 // there exists a deterministic transformation \(f>) — > p with fidelity 

> 1 — e, the transformation \<f>) > * s possible with probability 1 — 6e, 

forL= L T (!-6e)/6j. 



Proof. See Appendix C 



□ 



This lemma implies that for any sequence of deterministic schemes belonging to Cf(t) 
with size {L n }, there exists a sequence belonging to C(r) with the size {L' n }, where 
L' n > L n /7 for large n. Hence, 



maxlimsup [ — logL„ ) < maxlimsup | — \og(7L' n ) 1 = E(r) 

C F (r) n->oo \n ) C(r) „^oo [n J 



(64) 



From Eqs (|6^) and (|64|), we have 

Ep{r) = maxlimsup [ — logi„ ) = max (r), lim inf ( — logL„ ) = E (65) 

Cf(t) n^oo \n J C P (r) n-»oo \n J 

which shows that the entanglement yield Ep can be expressed as the same function 
as that for probabilistic cases. Though the probabilistic argument assumes that we 
can transform the initial state to the exact copy of a maximally entangled state with 
probability close to one, the fidelity argument is much more natural in that it imposes 
weaker restrictions on manipulation where the fidelity close to one, which is common 
to more general entanglement manipulations, such as entanglement dilution. 



5. Strong converse 



We have investigated how entanglement yield behaves when the failure probability 
exponentially decreases. In this section, conversely, we discuss asymptotic 
entanglement concentration with exponentially decreasing success probability, which 
will finally lead to the strong converse of asymptotic entanglement concentration. 
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The following discussion is parallel to that in the previous section. So, when we 
distil a maximally entangled state of size L* from \<fi)® n , the following relations also 
hold: 

P K =^min{t n ,p"(i)}, (66) 

i 

and 

P K = t„L* n . (67) 

We assume that the success probability converges to zero as the number of the 
entangled pairs n increases. Then, the first-order coefficient in the exponent of the 
success probability in the asymptotic limit becomes 

r = lim f logPj,. ) . (68) 

?woo \ n ™ J 

Intuitively, this means that the success probability behaves as 2~ nr . We will derive 
the maximum number of Bell pairs distilled per copy in the asymptotic limit, E* , as 
a function of the exponent r. First, we prove a theorem that relates entanglement 
yield and an exponent via a monotone function, from which we will derive a formula 
for entanglement yield E*[r). 

Theorem 10 Consider a sequence of entanglement concentration schemes that 
convert n identical copies of \(f>) — ^2f = i y/Pi\i)\i) , i-e-, |<^)®"> into a maximally 
entangled state of size L* n , which attain the optimal success probability Pl^- Suppose 

liminf (-logL^) >H(p), (69) 



n 



and 



n 

where p = (pi, • ■ -Pd)- Then, 



-logL,* <logd, (70) 



limsup ( - logL; ) = g ( limsup ( -- logP L - I I , (71) 



IT J \ n — *oo \ n 



and 



where 



linimf ( ilogi^ = 5 niminf (~logP L . ) ) . (72; 



g(r) ee max H{q). (73) 

q:D(q\\p)<r 



Proof. Let 



Then, equation mm gives 



R n ee -- log t n . (74) 
n 



--\ogP K =R n --logL* n . (75) 
n n 

In what follows, we only consider a convergent sub-sequence of {R n }, that is, take an 
infinite subset J\f C {1,2, . . .} such that R' ee lim„^ oc .„ eJ v' Rn exists. For simplicity, 
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we denote the sub-sequence {R n } n £Af a s {R' n }, and omit n G TV. Equations ( |69| ) and 
@ imply 

ff(p) < i?'. (76) 
First, we consider the case 

d 

#<- 5 X>gpi- ( 7? ) 

i— 1 

Then, for sufficiently large n, we have 
1 d 

<<-^E lo s^- ( 78 ) 

Equation ( |66| ) gives 



d 

i=i 



- ilogP L ., = -ilog^min{4,p"(i)}, (79) 



ft " n 



where {P L *'} and {i^} are sub-sequences related to {R' n } by equations fl74| ) and (|75|). 

In the following, we estimate the right-hand side of equation ( f79[) by the method 
of types. Rewriting the area below the truncating probability t' n in terms of the type 
theory, we obtain 



X ^min{Cp"(i)}= E l T <X + E P"( T <P- ( 8 °) 

qeV n :p n (q)>t' n qeV n :p n (q)<t'„ 



Invoking equations (|20D, (|23[) , fl24j), and (|25[) , we have the inequalities 

E + E p"( T <D 



~9 

qeV n :p n (q)>t' n qGV„:p n (q)<t'„ 



< 2 niHiq) - R '" ) + ^2 2~ nD{qlM (81) 

qeV n :D(q\\p)+H(q)<R' n qeV n :D(q\\p)+H(q)>R.' n 



< (n+l) d 



max 2 n{ - H{ - qS) ~ R '^ + max 2~ nD( - q ^ 

q£-P n :D(q\\p)+H(q)<R' n qeV n :D(q\\p)+H(q)>R' n 



(82) 



According to Lemma |l5j in Appendix B , setting S{q) = D{q \\ p) + H(q), 
T(q) = H(q), x = R, and noting that x\ — — logpi, and X2 — —d^ 1 ^2 i=1 \ogpi, 
we see that the function R i— > m.SiX q . D/ q \\ p )+H(q)=R H(q) is continuous and strictly 



Thus, 



monotone increasing in [0, logd] for R G — logpi,— d 1 Yli=i ^°SPi 

maXg e7 5 ti . (g|| p ) +J3 '( 9 )< H / ( = ^3XqeV n :D{q\\p)+H(q)=R' n H (l) 

= maXqeV n :D(q\\p)+H(q)=R' n [R'n ~ D {l II P) ] ■ 

Similarly, according to Corollary [l6] in Appendix B| , setting S(q) = —{D(q \\ 
p) + H(q)}, U(q) = D(q \\ p), x = —R, and noting that X\ — \ogpa, and 
X2 = —H(jp), we see that the function —R i— > vomq-.D(q\\p)+H(q)=R D(q \\ p) is 
continuous and strictly monotone decreasing in [0, — \ogpd] for —R G [logpd, —H(p)]. 
This means that the function R i— > min-q-.D(q\\p)+H(q)=R D{q \\ p) is continuous 
and strictly monotone increasing in [0, — logpd] for R G [H(p), — logpd]. Thus, 
wn g ev n :D(q\\p)+H(q)>R' n D(q || p) = ram qe p. D ^\\ p ) +H (q)=R' n D(q \\ p). 

Then, combining equations (|80|) and (p3) and using equation (p0|), we have 



V min{i' ,p n (i)} < 2(n + l) d max 2 - nD{q ^ . (83) 
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Together with equation (|79|), we obtain 

liminf | --logPr,/ ) > min D(q\\p). (84) 

n-oo V n ^» I q:D{q\\p)+H{q)=R> 



Next, we derive a lower bound of equation (p0[). For any q satisfying D(q \\ p) + 
H{q) = R', there exists a sequence of types q' n £ V n such that D(q' n || p)+H{q' n ) > R' n 
and linin^oo = <?, due to the condition R' n < —d" 1 J2i=i ^°EPi < — l°gPd- 

Invoking equation (psj), we have 

E W + E p n ( T ? ")> E p"( T 9 ") 

qeV n :p n {q)>t' n q^V n :p n (q)<t' n q£V n :p n (q)<t' n 

>p n ( T V 

- ^TW 2 ~ nDiq ' M - (85) 

Together with equations ( |79| ) and (p0|), we obtain 

hmsup (~logP L ,,) < D(q || p), (86) 
which holds for any q satisfying D(q || p) + -ff (<z) = R' ■ Therefore, 

limsupf logPr,/ 1 < min D(q\\p). (87) 

Equations (Q) and (|87j ) imply that the sub-sequence |— n _1 logP L „' | is also 
convergent, i.e., 

r' = lim f --logPr,- ) = min D(o II »). (88) 

n^oo V n ,l / q:D( q \\p)+H(q)=R' W " ^ V ' 

As stated above, the function R i— ► min g:£) ( g || p ) + ^f( g ) = jj. Z?(q || p) is continuous and 
strictly monotone increasing in [0, — logp^] for R £ [H (p) 7 — logp^]. Hence, the 
inverse function of equation (88) clearly exists for R £ {H(p), — \ogpd). However, 
now we consider the range restricted to equation (|77|), we obtain, for r £ (0, c) 
with c = D(u || p) being the relative entropy between the uniform distribution 
u = (1/d, ■ ■ ■ , l/d) and p, 

R'(r')= max {D(q \\ p) + H(q)} . (89) 

q:D(q\\p)=r' 

Furthermore, from equation (|75|), we see that if R' and r' exist, then the sub- 
sequence |n~ 1 logi* | is also convergent, i.e., E* = lim^oo ^n _1 logi*^ exists, 
and 

r' = R'-E*'. (90) 

Hence, 

E*' = R'{r') - r' = max {D(q \\ p) + H(q)} - r' 

q:D(q\\p)=r' 

= max H(q). (91) 

q:D(q\\p)=r' 

According to Lemma |l5| in Appendix B, setting S(q) = D(q \\ p), T{q) = H{q), x = r, 
and noting that x\ = 0, and X2 = c, we see that the function r i— > max g: £)(g||p) =r H(q) 
is continuous and strictly monotone increasing in (H(p),\ogd) for r £ (0, c), thus 

max H(q) = max H(q). (92) 

q:D(q\\p)=r q:D(q\\p)<r 
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Therefore, in the case of equation ([77|), we have < r' < c, H(p) < E* < logd, and 
E*' (r') = g(r'). (93) 
Next, we consider the case R' > —d^ 1 J2i=i ^°SPi- equation ( |70| ) provides 

lim sup ( - log L*' ) < log d. (94) 

Together with equation (|75|), we have 

lim inf ( log P r »/ ) = R' - lim sup ( - log L*' ) > i?' - log d. (95) 

n->oo y n " J n^oo \n J 



Hence, 



lim inf 

n — >oo 



1 \ 1 

- log P Li > - - lo § Pi - log = c. (96) 



d 

i=i 



Then, since the function E* (r') is monotone increasing, we have 
liminfn^oo (n^ 1 logi* ^ > \mi r ^ c E* (r') = g(c) = logd. Note that for r > c 
= D{u || p), g(r) = ma,x q . D ( q \\ p < )<r H(q) = \ogd. Hence, together with equation (| 
we have 

log d — lim [ — log L* n ] = g (lim inf ( — — log P L , 



for 



n — >oo \ n I \ n — > oo \ ri 



.9 ( lim sup ( -\\o%P L *> ) ) , (<>7l 



?7 



lim inf ( -ilogPjy I > c. (98) 



77 



The above argument holds for any convergent sub-sequence {R' n }- Since r is an 
exponent of success probability, the function E* (r) is monotone increasing. And, so 
is g{r). Therefore, equations (§3), (p7|), and (|98|) provide equations (|7l|) and (|72|). □ 



Theorem [L0] leads to the following corollary that gives the maximum asymptotic 
entanglement yield E* (r) under the requirement that the success probability decreases 
as slowly as 2~ nr : 

Corollary 11 Consider a sequence of entanglement concentration schemes converting 
\(j))® n into a maximally entangled state of size L* n with success probability P S ucccss, such 
that 

r > lim sup f - - log P s (") cess ] . (99) 



Let us denote the class of all such sequences by C*{r). Then, for r > 0, 

E* (r) = max lim sup | — log L* ) = max lim inf | — log LI 

C*(r) n^oo \n n ) C*(r) n-°c \n 

= max (100) 

<7:-D(g||p)<r 
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E*(r) 



log d 



H(p) 




Figure 4. Entanglement yield in entanglement concentration whose success 
probability exponentially decreases (strong converse). The horizontal axis 
represents the exponent of the success probability. The vertical axis represents 
the number of Bell pairs distilled per copy in the asymptotic limit: E*(r) = 
raa,x q: D(q\\p)<r H(q). E*(r) reaches the maximum value, logo!, at r = c = 
— log d — d~ 1 i l°g Pi > which is the relative entropy between the uniform 
distribution u = (1/d, ■ ■ ■ , l/d) and {pi}. The broken line represents the 
entanglement yield Ep(r) as a function of the exponent of the fidelity. At the 



value r = r' , where -£-E*(r) 



1 



Ep(r) starts to deviate from E*(r). 



Entanglement yield can be related to fidelity also in the high- entanglement regime, 
where the aim of the task is to convert \<fi)® n deterministically into some final (possibly 
mixed) state that is as close to a maximally entangled state |$t*) of size T* as 
possible. We here require that the fidelity F n between the final state and the maximally 
entangled state |$t*) decrease as slowly as 2~ nr ; namely, 

r > limsup ( -~logF„ J . (101) 

n — >oo \ n J 

Let us denote the class of all such sequences by C* F (r). The maximum asymptotic 
entanglement yield E* F (r) over C* F (r) can be obtained by reducing the problem to that 
of probabilistic schemes via the following two lemmata. 

Lemma 12 // the transformation \4>) — ► *s possible with probability t\ 7 then, 

for any T > L, there exists a deterministic transformation \<j>) ► p with fidelity 

($t|/o|$t) > £i£2 with e 2 = L/T. 

The proof is straightforward by considering the case p — £i|$l)( ( I ) l| + (1 — ei)p'> by 
noting that |($t|*l)| 2 = L/T. 

Applying this lemma to asymptotic sequences by setting £2 = 2~ nx where x is an 
arbitrary nonnegative number, we have the following: for any sequence of probabilistic 
schemes belonging to C*(r) with size {L^}, there exists a sequence belonging to 
C* F (r + x) with the size {T* = L* t 2 nx }. Hence, 

1 



max lim inf 

C* F (r+x) n—>oc 



logT„* >E*(r)+x, v x>0, 



(102) 
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and therefore 

max liminf f- logT* ) > sup {E*(r - x) + x) . (103) 

C* F (r) n->oo \n J <x<r 

Lemma 13 Suppose that there exists a deterministic transformation \<j>) > p with 

fidelity F — ($>t\p\&t)- Then, there exist an integer L < T and a transformation 
\<j)) > \&l) with success probability P, satisfying 

, VTF - 1 

(104) 



Proof. See Appendix C 



Using this lemma, we derive an upper bound of ma,xc F ( r ) limsup^^^ (n -1 logT*). 
Consider a sequence of deterministic schemes where the fidelity between the final 
state of the n-th scheme and the maximally entangled state \&t*) is F n . Suppose that 

r > limsup ( --logF„ J , (105) 



n— >oc 

and 



= limsup ( - logT* I > r. (106) 



n — >oo 



n 



Let us take an infinite set of integers N such that lim n ^. oo;ne jv' ( n 1 log^n) = (3. 
According to Lemma |l3[ there exists a sequence of transformations \<f>)® n — > 
|$i»)(L* < T*) with success probability Psuccess satisfying 

p («) T * > \/ T ^ F n - 1 nn7 N 

Jsuccess-^n d. j^j • 

Since [3 > r, for any 6 > 0, there exists i%i such that for all n > n\, 

1 (log PtLsn + > I (log P„ + log T*) - 6. (108) 

Further, by equations (|l05| ) and ( |l06| ), there exists ri2 > such that for all n > n 2 
with n G J\f, 

- (log P S W CSS + logL;) > (3 - r - 6. (109) 
n 

Let us define r\ = - lim inf n _Kx,; ne- v logPj™ C cess) (or, r\ can be any 
accumulation value). Note that L* < T* and the above inequalities imply r\ < r. 
Then, for any 81,82 > 0, there are infinite values of n satisfying — i log Process < 
and n _1 logi* > /3— r + r\ — 6 2 . This means that E*(r% + Si) > (3 + n — r - 8 2 holds 
for any 61,62 > 0. Noting that E*(r) is monotone increasing, we have 

]imE*(x) > (3-r + n. (110) 

Note that what we have shown here is that for any sequence be long ing to Cp(r) 
and having [3 > r, there exists a value r±(< r) satisfying equation (jllfj). The same 
statement also holds for sequences with j3 < r, since the left-hand side of equation (110) 
is nonnegative and the right-hand side is nonpositive for r\ — 0. Hence, 

max lim sup ( — log T* ) < r + sup { lim E* (x) — r\ \ . (HI) 

C F (r) n^oo \n J 0<n<r [^l^l ) 
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Since E*(r) is continuous in (0, oo) and monotone increasing in [0,oo), 



equations (U03) and (111) determine E* F (r) as 



Ep (r) = max lim sup [ — log T* ) = max lim inf ( — log T* 

C* F (r) n~,oo \n J C F (r) n^oo \n 

= sup {E*{x) +r- x}. (112) 

0<x<r 

The function E* F (r) is shown by the broken curve in figure |[ Note that £pE* goes 
to oo at r — > and that it tends to as r — > oo. The value r' in the figure satisfies 
■^E* = 1 and from this point E* F {r) starts to deviate from E*(r) and increases 
linearly. 

In contrast to E* (r) , Ep (r) can reach any large value by making r large enough. 
This is because we are here allowing an exponentially small fidelity. Indeed, for a 
separable initial state in which E*(r) = everywhere, we still have E F {r) — r, which 
corresponds to the fact that separable states can attain fidelity 2~ N to N Bell pairs. 
In the region of large r, the entanglement in the initial state contributes to the vertical 
offset of E* F (r), which is given by E* F (r') — r' . 

6. Alternative formulae 

Corollaries and [ll] give the entanglement yields E(r) and E*(r) as a minimum or a 
maximum over a probability distribution q. In practical calculation, however, it is not 
easy to deal with such minimization or maximization over probability distributions. 
We therefore present alternative formulae for entanglement yield, which contain only 
minimization or maximization over a variable s. 

The entanglement yields in Corollaries and [ll] are also expressed as 

E(r)= S u P r -±m, (113) 

8>l 1 - S 



and 



where 



E *(r) = min ^i + M, (H4) 

V ^ 0< S <1 1 - 8 



The proof of equations (|113|) and (114) is given in Appendix D. The entanglement 



yields in the fidelity arguments can also be expressed in the same manner: 

, i 1 - s 



£ F (r)=sup^±il£), (116) 



and 



Ei(r] = ) ™°<s<i^f (0 < (n7) 
r — r + E [r ) (r <r) 



where r = r 1 is a value at which - j-E* (r) = 1. Equations (116) and (117) are easily 



obtained from equations (pa) and (112 



As an example derived by using the above formulae, we show non-additivity of the 
entanglement yield E{r). Additivity is an important property of -Eentropy (p) = H(p) 
and Edet(p) = — logpi: Entanglement of a composite system is the sum of the 
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contributions from each system, e.g., E entIopy (p ® a) = Entropy (p) + E cntIopy (a). 
(Note that p and a represent pure states.) This property, however, does not hold for 
the general expression of entanglement yield, E(r), as seen below. 
We can prove the following inequality of E r : 

E ri+r2 (p(g> a) < E ri {p) + E r2 (a). (118) 



From equation (113), we have 

E ri+r2 (p <g> a) = sup - ^ ^ — — 

s>i 1 - s 



sup 

s>l 



+ ^p(g) r 2 + ^p(s) 
1 - s 1-s 



. ri+Vv( s ) . r 2 +Vv(s) 
< sup — - h sup — 

S>1 1 - s s>l 1-S 

= E ri (p) + E T2 {a). (119) 
Equality holds if and only if s + ( ri) p = s + (r2) a , where s+(r) > 1 is a unique solution 



of r = —ip(s) — (1 — s)ip'(s) (see Appendix B). Thus we obtain equation (118) 



When r\ = r 2 = r/2, equation (|118|) provides 

E r (p®o-)<Er(p)+Er(o-). (120) 

Let the largest squared Schmidt coefficients of p and a be p\ and q±, respectively. 
Equality holds if and only if p and a satisfy s+(r/2) p = s+(r/2) CT . Note that this 
condition is satisfied when r = or r > — 2 log(min{pi, qi}), which recovers the 
additivity of Entropy or Ea et , respectively. 
Hence, 

Er{p) = ±E r {p®p). (121) 



Substituting this into equation ( |120| ), we obtain 

E r (p<S><j) < -{E r (p® p)+E r (a®<r)}. (122) 

This means that the entanglement yield in collective distillation of different states 
does not exceed the average of those in separate distillations of each state. For p and 



a such that s+(r/2) p ^ s+(r/2) a , the left-hand side of equation (122) is strictly less 
than the right hand side. 

On the other hand, consider the independent concentration of p and a with the 
same error exponent r. The failure probability of the whole process is 

1 - (1 - 2~ nr ) 2 = 2~ nr+1 - 2~ 2nr . (123) 



Thus, the error exponent of the whole process is also r (see Appendix A ) . Therefore, 



E r (p®<7) > E r (p) +E r {a), (124) 



which means that E r is generally non-additive. In fact, equation (121) provides 

E r (p®p) = 2Er(p)>2E r (p) for r<-21ogp 1 , (125) 
thus, the inequality in equation (|124|) is strict in this case. 



To sum up, we obtain the following relation on the general entanglement yield of 
composite pairs: 

E r (p) + E r (a) < E r {p®a) < ~ {E r (p <g> p) + E r (a <g> a)} . (126) 
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7. Conclusion 

We have discussed entanglement concentration with exponentially decreasing failure 
probability, as well as fidelity exponentially close to one, in the asymptotic limit. By 
employing the method of types, we derived the entanglement yield E(r) as a function 
of an error exponent r. The result fills the gap between the well-known least upper 
bound of entanglement yield represented by entropy and the maximum attained in 
deterministic concentration. The explicit dependence on the exponent of the success 
probability and of the fidelity was also presented, for the large-yield regime, in the 
form of a strong converse. 

In entanglement manipulation as well as other types of quantum information 
processing, deterministic, probabilistic, and high-fidelity transformations are 
considered. Our results represent a common generalization and refinement of all 
three approaches. They provide a unified view of probabilistic and deterministic 
transformations, and show that success probability and fidelity are essentially 
equivalent concepts; to be precise, high fidelity and high probability result in the 
same yield function, while the yield function for low fidelity coincides with that for 
low probability only for small exponents: for large exponents the latter saturates 
whereas the former becomes a straight line. The power of the error rate approach (i.e., 
of quantifying 'rare events') in information theory is demonstrated also in quantum 
information theory. 
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Appendix A. 

Lemma 14 If lim^oo a n < lim^^ b n , then 

lim -ilog(2-" a " +2- n6 ") = lim a n . (A.l) 

n^oo Tl n— >oo 



Proof. 



lim -- log(2-" a " + 2-™ b ") = lim -- log h- na " ( 1 + 2- n ^- a "A ) 

ra— >oo n n— >oo n I V /J 

= lim (a„-ilog(l + 2-"( b "- a ^l 

n— >oo |_ n V /J 



lim a n . (A.2) 

n— >oo 

□ 
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Appendix B. 

Lemma 15 Let V be a convex subset o/R n . Let S : V — > R be a continuous function, 
and let T : V — > R be a strictly concave function. Suppose that T takes its maximum 
at q 2 6 V and Sfa) — X2, and that min ge -p S(q) = X\. Then, 

R(x) = max T(q) {x x < x < x 2 ) (B.l) 

q:S(q)=x 

is strictly monotone increasing. 



Proof. Consider arbitrary x 1 and x" such that x\ < x' < x" < x 2 . Then, there 
exists q' such that R(x') = max q .g^ =x i T(q) = T(q') and S(q') = x' . Due to the 
continuity of S(q), there exists q" such that S(q") = x" . Since the domain V is a 
convex set, we have 

q" = Xq 2 + (1 - X)q' (0 < A < 1). (B.2) 

Then, the strict concavity of T(q) and our assumption provide 

T{q") > XT(q 2 ) + (1 - X)T(q') > T{q'). (B.3) 

Note that the strict concavity of T ensures T(q 2 ) > T(q'). Together with T(q") < 
maxg. s(9 ) =x " T(q), we have 

R(x') < R{x") for x 1 < x" . (B.4) 

□ 

We can easily derive the following corollary on convex function. 

Corollary 16 LetV be a convex subset o/R". Let S : V — > R be a continuous fuction, 
and let U : V — > R be a strictly convex function. Suppose that U takes its minimum 
at q 2 G V and S(q 2 ) — x 2 , and that min gg -p S(q) = X\. Then, 

Q(x) = min U{q) (xi < x < x 2 ) (B.5) 

q:S{q)=x 

is strictly monotone decreasing. 



Appendix C. 

This appendix gives the proofs of the lemmata used for reducing fidelity arguments to 
probabilistic ones in Sees. ^ and ||. Note that these lemmata are not confined to the 
analysis of i.i.d. entanglement concentration, but apply in the most general situations: 
In fact, they apply whenever we are only interested in the rate of the entanglement 
produced and of the error probabilities (fidelities). 

Proof of Lemma ^ According to Theorem 3 in reference the optimum fidelity 
is always achieved with a deterministic transformation that actually produces a pure 
state. So, our assumption implies that \4>) can be deterministically transformed into 
a pure state |</>') such that |(<I>t| </»')! 2 — 1 ~ e - We may assume (after suitable local 
rotations) 

i 

|*r>= £4=1^)' (C.2) 
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because this will only increase the fidelity. 
Then our assumption becomes 

2 



EvR^>(|vS^) 



>l-e, (C.3) 



from which we directly obtain 

|(^---L) 3 < 2 , (C.4) 

Suppose 

(1 + V2) 2 

Pi > P2 > ■ ■ ■ > PK > j, > PK+l > ■ ■■ >PT- (C.5) 

(Note that if p x < (1 + y/2) 2 /T < 6/T, then we can deterministically obtain \$l) 
with L = \T/6\ by the deterministic entanglement concentration Ji"3| reviewed in 
section 0.) Then, 



1 T ( 1 \ 2 \ K ( 1 \ 2 1^2 IK 



t~t^V a/t; -t^V a/TV ~t^t t 2 

Introducing /J = if/T < e, and defining S = our aim is to bound the latter 
probability. 

Note that with the restriction that the largest K probabilities pi sum to 5, the 
fidelity to |$t) is maximized for 

Pi = ---=Pk = j7, (C.7) 

PK+l = ■ ■ ■ = PT = y _ R , (C.8) 



yielding 



1-C< W,$T) <Fma X ((5,^) 



# Vr v ' \ t-k^TJ 

<V5y/p + VT~5^1~]3. (C.9) 
If we denote the right-hand side by 1 — ?y, we can solve for \fE, and obtain 

VS = y/0(l - V )± y/(l - 13)7,(2 - T,) 

<(l + V^)Ve- (CIO) 
Hence, 6 < 6e, and we can execute the following probabilistic protocol: First, Alice 
and Bob observe whether the state is in the subspace spanned by .. . , \K) or in the 
orthogonal complement, using a projective measurement. The first result occurs only 
with probability 8 < 6e. In the other case, the post-measurement state has Schmidt 
coefficients p\ = Pi/(1 — S). Hence, 
6 



T(l - 6e) 



>p'k+i>Pk+2>---, (Cll) 
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so by the deterministic entanglement concentration fll3| , we can obtain 
deterministically, with L = [T(l — 6e)/6j. Therefore, a pure state \(j>'), into which 
\4>) is deterministically transformed, can be transformed into with probability 
1 - 6e. □ 



Proof of Lemma 13 According to Theorem 3 in reference pa], the optimum fidelity 



is always achieved with a deterministic transformation that actually produces a pure 
state. So, our assumption implies that \4>) can be deterministically transformed into 



a pure state \<f>') such that |($ T |0')| 2 = F. 
the function p(x) defined by p(x) = pi [i — 
fidelity F to the state |$t) satisfies 

cT 



Let = 

1 < x < i) 



p(x) = 



/TF = ^pl + J^ ^/pjx)d2 



/p~i\i)\i), and introduce 
(M < x). Then, the 



(C.12) 



Substituting x 




ln(T) 



XV p(x)ds < ln(T) max 

_ 0<x<T 



^V*) • (C13) 



On the other hand, for any x, the probabilistic concentration with truncation value 
p{x) gives a probabilistic transformation into with success probability P with 
L > x and P > xp{x). (Note that P = p{x)L due to equation ( |l7| ) in section ||.) 
Hence, y/PL > x\/p(x). 

Combining those, we arrive at the conclusion that there exists a probabilistic 
scheme for some L <T satisfying 



ln(T) 



ln(T) 



(C.14) 

□ 
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Proof of equations (113) and (114) 



Consider the function 

F{s) = -1>(s) - (1 - s)V'(s) («>0), (D.l) 

where 

d 

1(s)=lo E J2ph (D.2) 

i=l 

and pi is a probability distribution. Differentiating these functions, we have the 
following relations: 

1'(s)=J2hi(s)\og Pi , (D.3) 

i 

where hi(s) is a probability distribution such that 

h(s) = (D.4) 
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^"(s) = ln2|^^( S )(log K ) 2 - ^>i(s)logp^ |>0, (D.5) 

and 

( > (s > 1) 

F'(s) = -(l-s)r(s)\ =0 {a = l) (D.6) 
[ < (0 < s < 1). 

(Note that Inx denotes natural logarithm.) Since F(l) = 0, lim^oo F(s) = — logpi, 
and F(0) = D(u \\ p) = c, where u = (1/d, ■ ■ ■ , 1/d) is the uniform distribution, there 
exist unique s+(r) > 1 and < s_(r) < 1 such that 

f F(«+(r)) re(0,-logpi) , . 

~ \ F(a_(r)) r e (0,c). 1 J 

Let g and /i(s) be probability distributions such that D(g || p) = D(h(s) || p) = r. 
Then, we have 

H(h(s)) - H(q) = - D(h(s) \\ P )-J2 h ^ lo ^ + D (9 II P) + E * lo §^ 

i i 

= ^ {*-/*(«)} logPi, (D.8) 

i 

and 

|| = J_ || />(*)) + D(h(s) || p) - D(q || p)} 

1 — s 1 — s 

= — {-?» log ftj(s) + /i»(s) log /ij(s) - /ij(s) logp, + % logpj 
1 — s z — ' 

i 

= < (i - *) E(* ~ logPi + ~ ^ s )) lo sZ]^ f 

[ » * i ) 

= 5^{</i-M«)}logpi. (D.9) 



Thus, 



which gives 



*T(/i(*)) - #(<?) = ^-D(« || Ms)), (D.10) 
1 — s 



H{h{s+{r))) <H{q) and H(h(s-{r)) > H(q). (D.ll) 
On the other hand, 

F( S ) = - log£>* - (1 - s) ^ ^(,s) logp, 
j » 
= ^^(s)log^(s) - ^/i J (s)logp l 

= £>(&(*) ||p), (D.12) 

hence, 

D(/»(*±(r)) ||p)=F(s±(r))=r. (D.13) 
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Therefore, 

s±r + ip(s±) _ s±F(s±) + i/j(s±) 
1 — s± 1 — s± 

= - 0(s±) ~ S±1p'(s±) 
i i 

= -/]hj(s±)log hj(s±) 

i 

= H(h(s±)). 

From equations ( D.ll ), ( D.13| ), and ( D.14 ), we obtain, for r e (0, — logpi), 

s + r + ifj(s+) 
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min H(q) 

q:D(q\\p)=r 



1-s, 



i.e., 



min {D(q\\p)+H(q)} 
q:D{q\\p)=r 



1-8+ ' 



and, for r 6 (0, c), 



s_r + ^>(s_) r + V(s_) 
max ii (q) = = r. 

q:D(q\\p)=r 1 — S_ 1 — S_ 



Next, consider the function 

r + %j){s) 



Then, 



Since 



we have 



G{s) 
G\s) 
F(s) 



1-s ' 

r - F(s) 

> r (s < s_, s + < s) 
< r (s_ < s < s + ), 



> (s_ < S < 8+) 

G'(s){ =0 (s = s+) 

< (s < s_, s + < s). 

(Note that we take s+ = oo for r > — logpi.) Thus, we obtain 

maxG(s) = G(s+(r)) for r € (0, — logpi), 

S>1 



and 



sup G(s) = lim G(s) = — \ogp 1 for r > — logpi. 

S>1 



(D.14) 
(D.15) 

(D.16) 
(D.17) 
(D.18) 
(D.19) 
(D.20) 

(D.21) 

(D.22) 
(D.23) 



As discussed in section ^, the left-hand side of equ ation ( D.16 ) is monotone decreasing 
with respect to r. Thus, together with equation (D.18), we obtain 



min {D(q\\p) + H(q)} = G(s+) for r e (0, - logpi). 

q:D(q\\p)<r 

Therefore, equations ( p.22[ ), ( p.23| ), and ( p.24| ) give equation ( pT3| ). 



(D.24) 
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On the other hand, for r £ (0, c), we have s_(r) > 0, thus 



hence, 



min G(s) =G(s_(r)), 

0<s<l 



mm = mm G(s) — r = G(s_(r)) — r. 

0<s<l 1 — s 0<s<l 



In addition, for r > c, we have s_(r) < 0, thus 
hence, 



min G(s) = G(0) = r + logd, 

0<s<l 



mm = mm G s — r = log a. 

0<s<l 1 - S 0<s<l 
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(D.25) 

(D.26) 
(D.27) 

(D.28) 



As discussed in section^, the left-hand side of equation (D.17) is monotone increasing 
with respect to r. Thus, together with equation (D.18), we obtain 

max H{q)=G{s-)-r for r £ (0,c). (D.29) 

q:D(q\\p)<r 



Therefore, equations (D.26), (D.28), and (D.29) give equation (114). 
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